Overview
Brought to you by YData
Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 712 |
| Missing cells | 550 |
| Missing cells (%) | 3.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 122.5 KiB |
| Average record size in memory | 176.2 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 12 |
| Text | 3 |
Age is highly overall correlated with AgeGroup | High correlation |
AgeGroup is highly overall correlated with Age | High correlation |
Deck is highly overall correlated with Pclass | High correlation |
Embarked is highly overall correlated with EmbarkedClass | High correlation |
EmbarkedClass is highly overall correlated with Embarked and 3 other fields | High correlation |
FamilyClass is highly overall correlated with FamilySize and 3 other fields | High correlation |
FamilySize is highly overall correlated with FamilyClass and 4 other fields | High correlation |
Fare is highly overall correlated with FamilySize | High correlation |
FareGroup is highly overall correlated with IsAlone and 2 other fields | High correlation |
GenderClass is highly overall correlated with EmbarkedClass and 5 other fields | High correlation |
IsAlone is highly overall correlated with FamilyClass and 4 other fields | High correlation |
Parch is highly overall correlated with FamilyClass and 2 other fields | High correlation |
Pclass is highly overall correlated with Deck and 4 other fields | High correlation |
Sex is highly overall correlated with GenderClass and 3 other fields | High correlation |
SibSp is highly overall correlated with FamilyClass and 2 other fields | High correlation |
Survived is highly overall correlated with GenderClass and 3 other fields | High correlation |
Title is highly overall correlated with GenderClass and 3 other fields | High correlation |
TitleClass is highly overall correlated with EmbarkedClass and 6 other fields | High correlation |
Deck is highly imbalanced (57.3%) | Imbalance |
Cabin has 550 (77.2%) missing values | Missing |
PassengerId is uniformly distributed | Uniform |
PassengerId has unique values | Unique |
Name has unique values | Unique |
SibSp has 480 (67.4%) zeros | Zeros |
Parch has 543 (76.3%) zeros | Zeros |
Fare has 11 (1.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-22 14:26:54.269261 |
|---|---|
| Analysis finished | 2025-08-22 14:27:09.740816 |
| Duration | 15.47 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
PassengerId
Real number (ℝ)
Uniform  Unique 
| Distinct | 712 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 356.5 |
| Minimum | 1 |
|---|---|
| Maximum | 712 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 36.55 |
| Q1 | 178.75 |
| median | 356.5 |
| Q3 | 534.25 |
| 95-th percentile | 676.45 |
| Maximum | 712 |
| Range | 711 |
| Interquartile range (IQR) | 355.5 |
Descriptive statistics
| Standard deviation | 205.68098 |
|---|---|
| Coefficient of variation (CV) | 0.57694525 |
| Kurtosis | -1.2 |
| Mean | 356.5 |
| Median Absolute Deviation (MAD) | 178 |
| Skewness | 0 |
| Sum | 253828 |
| Variance | 42304.667 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 712 | 1 | 0.1% |
| 1 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 6 | 1 | 0.1% |
| 7 | 1 | 0.1% |
| 696 | 1 | 0.1% |
| 695 | 1 | 0.1% |
| Other values (702) | 702 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 712 | 1 | |
| 711 | 1 | |
| 710 | 1 | |
| 709 | 1 | |
| 708 | 1 | |
| 707 | 1 | |
| 706 | 1 | |
| 705 | 1 | |
| 704 | 1 | |
| 703 | 1 |
Survived
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| not survived | |
|---|---|
| survived |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.438202 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | not survived |
|---|---|
| 2nd row | survived |
| 3rd row | survived |
| 4th row | survived |
| 5th row | not survived |
Common Values
| Value | Count | Frequency (%) |
| not survived | 434 | |
| survived | 278 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| survived | 712 | |
| not | 434 |
Most occurring characters
| Value | Count | Frequency (%) |
| v | 1424 | |
| s | 712 | |
| r | 712 | |
| e | 712 | |
| i | 712 | |
| d | 712 | |
| u | 712 | |
| t | 434 | 5.8% |
| o | 434 | 5.8% |
| n | 434 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7432 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| v | 1424 | |
| s | 712 | |
| r | 712 | |
| e | 712 | |
| i | 712 | |
| d | 712 | |
| u | 712 | |
| t | 434 | 5.8% |
| o | 434 | 5.8% |
| n | 434 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7432 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| v | 1424 | |
| s | 712 | |
| r | 712 | |
| e | 712 | |
| i | 712 | |
| d | 712 | |
| u | 712 | |
| t | 434 | 5.8% |
| o | 434 | 5.8% |
| n | 434 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7432 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| v | 1424 | |
| s | 712 | |
| r | 712 | |
| e | 712 | |
| i | 712 | |
| d | 712 | |
| u | 712 | |
| t | 434 | 5.8% |
| o | 434 | 5.8% |
| n | 434 | 5.8% |
Pclass
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| 3 | |
|---|---|
| 1 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 390 | |
| 1 | 175 | |
| 2 | 147 | 20.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 390 | |
| 1 | 175 | |
| 2 | 147 | 20.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 390 | |
| 1 | 175 | |
| 2 | 147 | 20.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 390 | |
| 1 | 175 | |
| 2 | 147 | 20.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 390 | |
| 1 | 175 | |
| 2 | 147 | 20.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 390 | |
| 1 | 175 | |
| 2 | 147 | 20.6% |
Name
Text
Unique 
| Distinct | 712 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
Length
| Max length | 82 |
|---|---|
| Median length | 53 |
| Mean length | 27.015449 |
| Min length | 12 |
Unique
| Unique | 712 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Braund, Mr. Owen Harris |
|---|---|
| 2nd row | Cumings, Mrs. John Bradley (Florence Briggs Thayer) |
| 3rd row | Heikkinen, Miss. Laina |
| 4th row | Futrelle, Mrs. Jacques Heath (Lily May Peel) |
| 5th row | Allen, Mr. William Henry |
| Value | Count | Frequency (%) |
| mr | 417 | 14.4% |
| miss | 155 | 5.4% |
| mrs | 100 | 3.5% |
| william | 53 | 1.8% |
| john | 36 | 1.2% |
| henry | 31 | 1.1% |
| master | 28 | 1.0% |
| charles | 21 | 0.7% |
| james | 20 | 0.7% |
| george | 18 | 0.6% |
| Other values (1262) | 2014 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2182 | 11.3% | |
| r | 1561 | 8.1% |
| e | 1354 | 7.0% |
| a | 1326 | 6.9% |
| i | 1089 | 5.7% |
| n | 1048 | 5.4% |
| s | 1043 | 5.4% |
| M | 907 | 4.7% |
| l | 860 | 4.5% |
| o | 798 | 4.1% |
| Other values (50) | 7067 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19235 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2182 | 11.3% | |
| r | 1561 | 8.1% |
| e | 1354 | 7.0% |
| a | 1326 | 6.9% |
| i | 1089 | 5.7% |
| n | 1048 | 5.4% |
| s | 1043 | 5.4% |
| M | 907 | 4.7% |
| l | 860 | 4.5% |
| o | 798 | 4.1% |
| Other values (50) | 7067 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19235 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2182 | 11.3% | |
| r | 1561 | 8.1% |
| e | 1354 | 7.0% |
| a | 1326 | 6.9% |
| i | 1089 | 5.7% |
| n | 1048 | 5.4% |
| s | 1043 | 5.4% |
| M | 907 | 4.7% |
| l | 860 | 4.5% |
| o | 798 | 4.1% |
| Other values (50) | 7067 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19235 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2182 | 11.3% | |
| r | 1561 | 8.1% |
| e | 1354 | 7.0% |
| a | 1326 | 6.9% |
| i | 1089 | 5.7% |
| n | 1048 | 5.4% |
| s | 1043 | 5.4% |
| M | 907 | 4.7% |
| l | 860 | 4.5% |
| o | 798 | 4.1% |
| Other values (50) | 7067 |
Sex
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.7191011 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 456 | |
| female | 256 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 456 | |
| female | 256 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| f | 256 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3360 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| f | 256 | 7.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3360 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| f | 256 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3360 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| f | 256 | 7.6% |
Age
Real number (ℝ)
High correlation 
| Distinct | 84 |
|---|---|
| Distinct (%) | 11.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.369031 |
| Minimum | 0.75 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 0.75 |
|---|---|
| 5-th percentile | 5.5 |
| Q1 | 21 |
| median | 26 |
| Q3 | 37 |
| 95-th percentile | 55 |
| Maximum | 80 |
| Range | 79.25 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 13.578078 |
|---|---|
| Coefficient of variation (CV) | 0.46232639 |
| Kurtosis | 0.51411154 |
| Mean | 29.369031 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.49319024 |
| Sum | 20910.75 |
| Variance | 184.3642 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 86 | 12.1% |
| 17.5 | 29 | 4.1% |
| 24 | 28 | 3.9% |
| 42 | 27 | 3.8% |
| 22 | 26 | 3.7% |
| 21 | 21 | 2.9% |
| 19 | 21 | 2.9% |
| 30 | 21 | 2.9% |
| 28 | 21 | 2.9% |
| 18 | 20 | 2.8% |
| Other values (74) | 412 |
| Value | Count | Frequency (%) |
| 0.75 | 2 | 0.3% |
| 0.83 | 1 | 0.1% |
| 0.92 | 1 | 0.1% |
| 1 | 5 | |
| 2 | 9 | |
| 3 | 6 | |
| 4 | 7 | |
| 5 | 3 | 0.4% |
| 5.5 | 4 | |
| 7 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 80 | 1 | 0.1% |
| 71 | 2 | |
| 70.5 | 1 | 0.1% |
| 70 | 1 | 0.1% |
| 66 | 1 | 0.1% |
| 65 | 3 | |
| 64 | 2 | |
| 63 | 2 | |
| 62 | 3 | |
| 61 | 3 |
SibSp
Real number (ℝ)
High correlation  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.52808989 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 480 |
| Zeros (%) | 67.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.0643423 |
|---|---|
| Coefficient of variation (CV) | 2.0154566 |
| Kurtosis | 15.989177 |
| Mean | 0.52808989 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4428225 |
| Sum | 376 |
| Variance | 1.1328245 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 480 | |
| 1 | 169 | 23.7% |
| 2 | 26 | 3.7% |
| 3 | 14 | 2.0% |
| 4 | 14 | 2.0% |
| 5 | 5 | 0.7% |
| 8 | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 480 | |
| 1 | 169 | 23.7% |
| 2 | 26 | 3.7% |
| 3 | 14 | 2.0% |
| 4 | 14 | 2.0% |
| 5 | 5 | 0.7% |
| 8 | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 8 | 4 | 0.6% |
| 5 | 5 | 0.7% |
| 4 | 14 | 2.0% |
| 3 | 14 | 2.0% |
| 2 | 26 | 3.7% |
| 1 | 169 | 23.7% |
| 0 | 480 |
Parch
Real number (ℝ)
High correlation  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.38202247 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 543 |
| Zeros (%) | 76.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.81312189 |
|---|---|
| Coefficient of variation (CV) | 2.1284661 |
| Kurtosis | 10.19931 |
| Mean | 0.38202247 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.8007415 |
| Sum | 272 |
| Variance | 0.66116721 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 543 | |
| 1 | 92 | 12.9% |
| 2 | 66 | 9.3% |
| 5 | 4 | 0.6% |
| 4 | 4 | 0.6% |
| 3 | 2 | 0.3% |
| 6 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 543 | |
| 1 | 92 | 12.9% |
| 2 | 66 | 9.3% |
| 3 | 2 | 0.3% |
| 4 | 4 | 0.6% |
| 5 | 4 | 0.6% |
| 6 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | 0.1% |
| 5 | 4 | 0.6% |
| 4 | 4 | 0.6% |
| 3 | 2 | 0.3% |
| 2 | 66 | 9.3% |
| 1 | 92 | 12.9% |
| 0 | 543 |
Ticket
Text
| Distinct | 566 |
|---|---|
| Distinct (%) | 79.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 6.8019663 |
| Min length | 3 |
Unique
| Unique | 470 ? |
|---|---|
| Unique (%) | 66.0% |
Sample
| 1st row | A/5 21171 |
|---|---|
| 2nd row | PC 17599 |
| 3rd row | STON/O2. 3101282 |
| 4th row | 113803 |
| 5th row | 373450 |
| Value | Count | Frequency (%) |
| pc | 52 | 5.7% |
| c.a | 23 | 2.5% |
| a/5 | 17 | 1.9% |
| 2 | 11 | 1.2% |
| ca | 11 | 1.2% |
| ston/o | 11 | 1.2% |
| sc/paris | 8 | 0.9% |
| soton/o.q | 7 | 0.8% |
| w./c | 6 | 0.7% |
| 2144 | 6 | 0.7% |
| Other values (591) | 763 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 607 | |
| 1 | 569 | |
| 2 | 465 | |
| 7 | 389 | 8.0% |
| 4 | 378 | 7.8% |
| 6 | 330 | 6.8% |
| 0 | 319 | 6.6% |
| 5 | 309 | 6.4% |
| 9 | 266 | 5.5% |
| 8 | 216 | 4.5% |
| Other values (25) | 995 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4843 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 607 | |
| 1 | 569 | |
| 2 | 465 | |
| 7 | 389 | 8.0% |
| 4 | 378 | 7.8% |
| 6 | 330 | 6.8% |
| 0 | 319 | 6.6% |
| 5 | 309 | 6.4% |
| 9 | 266 | 5.5% |
| 8 | 216 | 4.5% |
| Other values (25) | 995 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4843 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 607 | |
| 1 | 569 | |
| 2 | 465 | |
| 7 | 389 | 8.0% |
| 4 | 378 | 7.8% |
| 6 | 330 | 6.8% |
| 0 | 319 | 6.6% |
| 5 | 309 | 6.4% |
| 9 | 266 | 5.5% |
| 8 | 216 | 4.5% |
| Other values (25) | 995 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4843 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 607 | |
| 1 | 569 | |
| 2 | 465 | |
| 7 | 389 | 8.0% |
| 4 | 378 | 7.8% |
| 6 | 330 | 6.8% |
| 0 | 319 | 6.6% |
| 5 | 309 | 6.4% |
| 9 | 266 | 5.5% |
| 8 | 216 | 4.5% |
| Other values (25) | 995 |
Fare
Real number (ℝ)
High correlation  Zeros 
| Distinct | 228 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.509538 |
| Minimum | 0 |
|---|---|
| Maximum | 512.3292 |
| Zeros | 11 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.225 |
| Q1 | 7.925 |
| median | 15.0229 |
| Q3 | 31.275 |
| 95-th percentile | 113.275 |
| Maximum | 512.3292 |
| Range | 512.3292 |
| Interquartile range (IQR) | 23.35 |
Descriptive statistics
| Standard deviation | 48.67271 |
|---|---|
| Coefficient of variation (CV) | 1.4971825 |
| Kurtosis | 31.311056 |
| Mean | 32.509538 |
| Median Absolute Deviation (MAD) | 7.3833 |
| Skewness | 4.5799705 |
| Sum | 23146.791 |
| Variance | 2369.0327 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.05 | 40 | 5.6% |
| 13 | 30 | 4.2% |
| 7.8958 | 29 | 4.1% |
| 26 | 27 | 3.8% |
| 7.75 | 27 | 3.8% |
| 10.5 | 19 | 2.7% |
| 26.55 | 14 | 2.0% |
| 7.925 | 14 | 2.0% |
| 7.25 | 12 | 1.7% |
| 7.8542 | 11 | 1.5% |
| Other values (218) | 489 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 4.0125 | 1 | 0.1% |
| 6.2375 | 1 | 0.1% |
| 6.4958 | 2 | 0.3% |
| 6.75 | 2 | 0.3% |
| 6.8583 | 1 | 0.1% |
| 6.975 | 1 | 0.1% |
| 7.0458 | 1 | 0.1% |
| 7.05 | 5 | |
| 7.0542 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 512.3292 | 2 | |
| 263 | 4 | |
| 262.375 | 1 | 0.1% |
| 247.5208 | 2 | |
| 227.525 | 3 | |
| 221.7792 | 1 | 0.1% |
| 211.5 | 1 | 0.1% |
| 211.3375 | 1 | 0.1% |
| 164.8667 | 1 | 0.1% |
| 153.4625 | 3 |
Cabin
Text
Missing 
| Distinct | 122 |
|---|---|
| Distinct (%) | 75.3% |
| Missing | 550 |
| Missing (%) | 77.2% |
| Memory size | 5.7 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 3 |
| Mean length | 3.5185185 |
| Min length | 1 |
Unique
| Unique | 90 ? |
|---|---|
| Unique (%) | 55.6% |
Sample
| 1st row | C85 |
|---|---|
| 2nd row | C123 |
| 3rd row | E46 |
| 4th row | G6 |
| 5th row | C103 |
| Value | Count | Frequency (%) |
| c23 | 4 | 2.1% |
| c25 | 4 | 2.1% |
| c27 | 4 | 2.1% |
| g6 | 4 | 2.1% |
| f33 | 3 | 1.6% |
| c22 | 3 | 1.6% |
| c26 | 3 | 1.6% |
| d | 3 | 1.6% |
| f2 | 3 | 1.6% |
| f | 3 | 1.6% |
| Other values (125) | 153 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 64 | |
| 2 | 59 | 10.4% |
| 3 | 51 | 8.9% |
| 1 | 44 | 7.7% |
| 6 | 41 | 7.2% |
| B | 41 | 7.2% |
| 5 | 34 | 6.0% |
| 4 | 28 | 4.9% |
| 8 | 27 | 4.7% |
| D | 26 | 4.6% |
| Other values (9) | 155 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 570 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 64 | |
| 2 | 59 | 10.4% |
| 3 | 51 | 8.9% |
| 1 | 44 | 7.7% |
| 6 | 41 | 7.2% |
| B | 41 | 7.2% |
| 5 | 34 | 6.0% |
| 4 | 28 | 4.9% |
| 8 | 27 | 4.7% |
| D | 26 | 4.6% |
| Other values (9) | 155 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 570 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 64 | |
| 2 | 59 | 10.4% |
| 3 | 51 | 8.9% |
| 1 | 44 | 7.7% |
| 6 | 41 | 7.2% |
| B | 41 | 7.2% |
| 5 | 34 | 6.0% |
| 4 | 28 | 4.9% |
| 8 | 27 | 4.7% |
| D | 26 | 4.6% |
| Other values (9) | 155 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 570 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 64 | |
| 2 | 59 | 10.4% |
| 3 | 51 | 8.9% |
| 1 | 44 | 7.7% |
| 6 | 41 | 7.2% |
| B | 41 | 7.2% |
| 5 | 34 | 6.0% |
| 4 | 28 | 4.9% |
| 8 | 27 | 4.7% |
| D | 26 | 4.6% |
| Other values (9) | 155 |
Embarked
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| S | |
|---|---|
| C | |
| Q |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | C |
| 3rd row | S |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 510 | |
| C | 138 | 19.4% |
| Q | 64 | 9.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 510 | |
| c | 138 | 19.4% |
| q | 64 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 510 | |
| C | 138 | 19.4% |
| Q | 64 | 9.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 510 | |
| C | 138 | 19.4% |
| Q | 64 | 9.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 510 | |
| C | 138 | 19.4% |
| Q | 64 | 9.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 510 | |
| C | 138 | 19.4% |
| Q | 64 | 9.0% |
Title
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| Mr | |
|---|---|
| Miss | |
| Mrs | |
| Master | 28 |
| Rare | 20 |
Length
| Max length | 6 |
|---|---|
| Median length | 2 |
| Mean length | 2.7837079 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mr |
|---|---|
| 2nd row | Mrs |
| 3rd row | Miss |
| 4th row | Mrs |
| 5th row | Mr |
Common Values
| Value | Count | Frequency (%) |
| Mr | 413 | |
| Miss | 155 | 21.8% |
| Mrs | 96 | 13.5% |
| Master | 28 | 3.9% |
| Rare | 20 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mr | 413 | |
| miss | 155 | 21.8% |
| mrs | 96 | 13.5% |
| master | 28 | 3.9% |
| rare | 20 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| i | 155 | 7.8% |
| a | 48 | 2.4% |
| e | 48 | 2.4% |
| t | 28 | 1.4% |
| R | 20 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1982 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| i | 155 | 7.8% |
| a | 48 | 2.4% |
| e | 48 | 2.4% |
| t | 28 | 1.4% |
| R | 20 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1982 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| i | 155 | 7.8% |
| a | 48 | 2.4% |
| e | 48 | 2.4% |
| t | 28 | 1.4% |
| R | 20 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1982 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| i | 155 | 7.8% |
| a | 48 | 2.4% |
| e | 48 | 2.4% |
| t | 28 | 1.4% |
| R | 20 | 1.0% |
FamilySize
Real number (ℝ)
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9101124 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.5788008 |
|---|---|
| Coefficient of variation (CV) | 0.82654867 |
| Kurtosis | 8.2223885 |
| Mean | 1.9101124 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.597848 |
| Sum | 1360 |
| Variance | 2.4926121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 2 | 133 | 18.7% |
| 3 | 86 | 12.1% |
| 4 | 21 | 2.9% |
| 6 | 18 | 2.5% |
| 5 | 12 | 1.7% |
| 7 | 10 | 1.4% |
| 8 | 6 | 0.8% |
| 11 | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 2 | 133 | 18.7% |
| 3 | 86 | 12.1% |
| 4 | 21 | 2.9% |
| 5 | 12 | 1.7% |
| 6 | 18 | 2.5% |
| 7 | 10 | 1.4% |
| 8 | 6 | 0.8% |
| 11 | 4 | 0.6% |
| Value | Count | Frequency (%) |
| 11 | 4 | 0.6% |
| 8 | 6 | 0.8% |
| 7 | 10 | 1.4% |
| 6 | 18 | 2.5% |
| 5 | 12 | 1.7% |
| 4 | 21 | 2.9% |
| 3 | 86 | 12.1% |
| 2 | 133 | 18.7% |
| 1 | 422 |
IsAlone
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 0 | 290 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 0 | 290 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 0 | 290 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 0 | 290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 0 | 290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 422 | |
| 0 | 290 |
Deck
Categorical
High correlation  Imbalance 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| U | |
|---|---|
| C | 52 |
| B | 32 |
| D | 25 |
| E | 24 |
| Other values (4) | 29 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | U |
|---|---|
| 2nd row | C |
| 3rd row | U |
| 4th row | C |
| 5th row | U |
Common Values
| Value | Count | Frequency (%) |
| U | 550 | |
| C | 52 | 7.3% |
| B | 32 | 4.5% |
| D | 25 | 3.5% |
| E | 24 | 3.4% |
| A | 13 | 1.8% |
| F | 11 | 1.5% |
| G | 4 | 0.6% |
| T | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| u | 550 | |
| c | 52 | 7.3% |
| b | 32 | 4.5% |
| d | 25 | 3.5% |
| e | 24 | 3.4% |
| a | 13 | 1.8% |
| f | 11 | 1.5% |
| g | 4 | 0.6% |
| t | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 550 | |
| C | 52 | 7.3% |
| B | 32 | 4.5% |
| D | 25 | 3.5% |
| E | 24 | 3.4% |
| A | 13 | 1.8% |
| F | 11 | 1.5% |
| G | 4 | 0.6% |
| T | 1 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 550 | |
| C | 52 | 7.3% |
| B | 32 | 4.5% |
| D | 25 | 3.5% |
| E | 24 | 3.4% |
| A | 13 | 1.8% |
| F | 11 | 1.5% |
| G | 4 | 0.6% |
| T | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 550 | |
| C | 52 | 7.3% |
| B | 32 | 4.5% |
| D | 25 | 3.5% |
| E | 24 | 3.4% |
| A | 13 | 1.8% |
| F | 11 | 1.5% |
| G | 4 | 0.6% |
| T | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 712 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 550 | |
| C | 52 | 7.3% |
| B | 32 | 4.5% |
| D | 25 | 3.5% |
| E | 24 | 3.4% |
| A | 13 | 1.8% |
| F | 11 | 1.5% |
| G | 4 | 0.6% |
| T | 1 | 0.1% |
AgeGroup
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| Young Adult | |
|---|---|
| Adult | |
| Teenager | |
| Child | |
| Senior | 19 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 8.5154494 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Young Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Young Adult |
| 4th row | Young Adult |
| 5th row | Young Adult |
Common Values
| Value | Count | Frequency (%) |
| Young Adult | 373 | |
| Adult | 183 | |
| Teenager | 82 | 11.5% |
| Child | 55 | 7.7% |
| Senior | 19 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| adult | 556 | |
| young | 373 | |
| teenager | 82 | 7.6% |
| child | 55 | 5.1% |
| senior | 19 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 929 | |
| d | 611 | |
| l | 611 | |
| A | 556 | |
| t | 556 | |
| n | 474 | |
| g | 455 | |
| o | 392 | |
| Y | 373 | |
| 373 | ||
| Other values (8) | 733 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6063 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| u | 929 | |
| d | 611 | |
| l | 611 | |
| A | 556 | |
| t | 556 | |
| n | 474 | |
| g | 455 | |
| o | 392 | |
| Y | 373 | |
| 373 | ||
| Other values (8) | 733 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6063 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| u | 929 | |
| d | 611 | |
| l | 611 | |
| A | 556 | |
| t | 556 | |
| n | 474 | |
| g | 455 | |
| o | 392 | |
| Y | 373 | |
| 373 | ||
| Other values (8) | 733 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6063 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| u | 929 | |
| d | 611 | |
| l | 611 | |
| A | 556 | |
| t | 556 | |
| n | 474 | |
| g | 455 | |
| o | 392 | |
| Y | 373 | |
| 373 | ||
| Other values (8) | 733 |
FareGroup
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| Low | |
|---|---|
| Medium-High | |
| High | |
| Medium-Low |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.9410112 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Low |
|---|---|
| 2nd row | High |
| 3rd row | Low |
| 4th row | High |
| 5th row | Medium-Low |
Common Values
| Value | Count | Frequency (%) |
| Low | 186 | |
| Medium-High | 180 | |
| High | 176 | |
| Medium-Low | 170 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| low | 186 | |
| medium-high | 180 | |
| high | 176 | |
| medium-low | 170 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 706 | |
| o | 356 | 7.2% |
| w | 356 | 7.2% |
| h | 356 | 7.2% |
| L | 356 | 7.2% |
| g | 356 | 7.2% |
| H | 356 | 7.2% |
| e | 350 | 7.1% |
| M | 350 | 7.1% |
| m | 350 | 7.1% |
| Other values (3) | 1050 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4942 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 706 | |
| o | 356 | 7.2% |
| w | 356 | 7.2% |
| h | 356 | 7.2% |
| L | 356 | 7.2% |
| g | 356 | 7.2% |
| H | 356 | 7.2% |
| e | 350 | 7.1% |
| M | 350 | 7.1% |
| m | 350 | 7.1% |
| Other values (3) | 1050 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4942 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 706 | |
| o | 356 | 7.2% |
| w | 356 | 7.2% |
| h | 356 | 7.2% |
| L | 356 | 7.2% |
| g | 356 | 7.2% |
| H | 356 | 7.2% |
| e | 350 | 7.1% |
| M | 350 | 7.1% |
| m | 350 | 7.1% |
| Other values (3) | 1050 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4942 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 706 | |
| o | 356 | 7.2% |
| w | 356 | 7.2% |
| h | 356 | 7.2% |
| L | 356 | 7.2% |
| g | 356 | 7.2% |
| H | 356 | 7.2% |
| e | 350 | 7.1% |
| M | 350 | 7.1% |
| m | 350 | 7.1% |
| Other values (3) | 1050 |
EmbarkedClass
Categorical
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| S3 | |
|---|---|
| S2 | |
| S1 | |
| C1 | |
| Q3 | |
| Other values (4) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S3 |
|---|---|
| 2nd row | C1 |
| 3rd row | S3 |
| 4th row | S1 |
| 5th row | S3 |
Common Values
| Value | Count | Frequency (%) |
| S3 | 279 | |
| S2 | 131 | |
| S1 | 100 | 14.0% |
| C1 | 73 | 10.3% |
| Q3 | 59 | 8.3% |
| C3 | 52 | 7.3% |
| C2 | 13 | 1.8% |
| Q2 | 3 | 0.4% |
| Q1 | 2 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s3 | 279 | |
| s2 | 131 | |
| s1 | 100 | 14.0% |
| c1 | 73 | 10.3% |
| q3 | 59 | 8.3% |
| c3 | 52 | 7.3% |
| c2 | 13 | 1.8% |
| q2 | 3 | 0.4% |
| q1 | 2 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 510 | |
| 3 | 390 | |
| 1 | 175 | 12.3% |
| 2 | 147 | 10.3% |
| C | 138 | 9.7% |
| Q | 64 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1424 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 510 | |
| 3 | 390 | |
| 1 | 175 | 12.3% |
| 2 | 147 | 10.3% |
| C | 138 | 9.7% |
| Q | 64 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1424 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 510 | |
| 3 | 390 | |
| 1 | 175 | 12.3% |
| 2 | 147 | 10.3% |
| C | 138 | 9.7% |
| Q | 64 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1424 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 510 | |
| 3 | 390 | |
| 1 | 175 | 12.3% |
| 2 | 147 | 10.3% |
| C | 138 | 9.7% |
| Q | 64 | 4.5% |
GenderClass
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| male3 | |
|---|---|
| female3 | |
| male1 | |
| male2 | |
| female1 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.7191011 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male3 |
|---|---|
| 2nd row | female1 |
| 3rd row | female3 |
| 4th row | female1 |
| 5th row | male3 |
Common Values
| Value | Count | Frequency (%) |
| male3 | 269 | |
| female3 | 121 | |
| male1 | 102 | 14.3% |
| male2 | 85 | 11.9% |
| female1 | 73 | 10.3% |
| female2 | 62 | 8.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male3 | 269 | |
| female3 | 121 | |
| male1 | 102 | 14.3% |
| male2 | 85 | 11.9% |
| female1 | 73 | 10.3% |
| female2 | 62 | 8.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| 3 | 390 | |
| f | 256 | 6.3% |
| 1 | 175 | 4.3% |
| 2 | 147 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4072 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| 3 | 390 | |
| f | 256 | 6.3% |
| 1 | 175 | 4.3% |
| 2 | 147 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4072 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| 3 | 390 | |
| f | 256 | 6.3% |
| 1 | 175 | 4.3% |
| 2 | 147 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4072 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 968 | |
| m | 712 | |
| a | 712 | |
| l | 712 | |
| 3 | 390 | |
| f | 256 | 6.3% |
| 1 | 175 | 4.3% |
| 2 | 147 | 3.6% |
TitleClass
Categorical
High correlation 
| Distinct | 14 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.7 KiB |
| Mr3 | |
|---|---|
| Mr1 | |
| Miss3 | |
| Mr2 | |
| Miss1 | |
| Other values (9) |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.7837079 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mr3 |
|---|---|
| 2nd row | Mrs1 |
| 3rd row | Miss3 |
| 4th row | Mrs1 |
| 5th row | Mr3 |
Common Values
| Value | Count | Frequency (%) |
| Mr3 | 249 | |
| Mr1 | 91 | 12.8% |
| Miss3 | 87 | 12.2% |
| Mr2 | 73 | 10.3% |
| Miss1 | 39 | 5.5% |
| Mrs3 | 34 | 4.8% |
| Mrs2 | 32 | 4.5% |
| Mrs1 | 30 | 4.2% |
| Miss2 | 29 | 4.1% |
| Master3 | 20 | 2.8% |
| Other values (4) | 28 | 3.9% |
Length
| Value | Count | Frequency (%) |
| mr3 | 249 | |
| mr1 | 91 | 12.8% |
| miss3 | 87 | 12.2% |
| mr2 | 73 | 10.3% |
| miss1 | 39 | 5.5% |
| mrs3 | 34 | 4.8% |
| mrs2 | 32 | 4.5% |
| mrs1 | 30 | 4.2% |
| miss2 | 29 | 4.1% |
| master3 | 20 | 2.8% |
| Other values (4) | 28 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| 3 | 390 | |
| 1 | 175 | 6.5% |
| i | 155 | 5.8% |
| 2 | 147 | 5.5% |
| a | 48 | 1.8% |
| e | 48 | 1.8% |
| t | 28 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2694 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| 3 | 390 | |
| 1 | 175 | 6.5% |
| i | 155 | 5.8% |
| 2 | 147 | 5.5% |
| a | 48 | 1.8% |
| e | 48 | 1.8% |
| t | 28 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2694 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| 3 | 390 | |
| 1 | 175 | 6.5% |
| i | 155 | 5.8% |
| 2 | 147 | 5.5% |
| a | 48 | 1.8% |
| e | 48 | 1.8% |
| t | 28 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2694 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 692 | |
| r | 557 | |
| s | 434 | |
| 3 | 390 | |
| 1 | 175 | 6.5% |
| i | 155 | 5.8% |
| 2 | 147 | 5.5% |
| a | 48 | 1.8% |
| e | 48 | 1.8% |
| t | 28 | 1.0% |
FamilyClass
Real number (ℝ)
High correlation 
| Distinct | 20 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.40309 |
| Minimum | 11 |
|---|---|
| Maximum | 113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.7 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 13 |
| median | 13 |
| Q3 | 23 |
| 95-th percentile | 61 |
| Maximum | 113 |
| Range | 102 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 15.869161 |
|---|---|
| Coefficient of variation (CV) | 0.74144251 |
| Kurtosis | 8.3934816 |
| Mean | 21.40309 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.6305272 |
| Sum | 15239 |
| Variance | 251.83026 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 251 | |
| 11 | 87 | 12.2% |
| 12 | 84 | 11.8% |
| 21 | 59 | 8.3% |
| 23 | 48 | 6.7% |
| 33 | 42 | 5.9% |
| 22 | 26 | 3.7% |
| 32 | 25 | 3.5% |
| 31 | 19 | 2.7% |
| 63 | 13 | 1.8% |
| Other values (10) | 58 | 8.1% |
| Value | Count | Frequency (%) |
| 11 | 87 | 12.2% |
| 12 | 84 | 11.8% |
| 13 | 251 | |
| 21 | 59 | 8.3% |
| 22 | 26 | 3.7% |
| 23 | 48 | 6.7% |
| 31 | 19 | 2.7% |
| 32 | 25 | 3.5% |
| 33 | 42 | 5.9% |
| 41 | 5 | 0.7% |
| Value | Count | Frequency (%) |
| 113 | 4 | 0.6% |
| 83 | 6 | |
| 73 | 10 | |
| 63 | 13 | |
| 62 | 1 | 0.1% |
| 61 | 4 | 0.6% |
| 53 | 11 | |
| 51 | 1 | 0.1% |
| 43 | 5 | 0.7% |
| 42 | 11 |
Interactions
Correlations
| Age | AgeGroup | Deck | Embarked | EmbarkedClass | FamilyClass | FamilySize | Fare | FareGroup | GenderClass | IsAlone | Parch | PassengerId | Pclass | Sex | SibSp | Survived | Title | TitleClass | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.808 | 0.167 | 0.109 | 0.164 | -0.356 | -0.221 | 0.174 | 0.228 | 0.252 | 0.378 | -0.258 | 0.090 | 0.307 | 0.291 | -0.189 | 0.217 | 0.418 | 0.320 |
| AgeGroup | 0.808 | 1.000 | 0.185 | 0.194 | 0.252 | 0.286 | 0.274 | 0.090 | 0.218 | 0.292 | 0.379 | 0.284 | 0.075 | 0.314 | 0.273 | 0.257 | 0.132 | 0.432 | 0.496 |
| Deck | 0.167 | 0.185 | 1.000 | 0.218 | 0.313 | 0.049 | 0.000 | 0.290 | 0.356 | 0.393 | 0.189 | 0.000 | 0.000 | 0.598 | 0.159 | 0.000 | 0.294 | 0.137 | 0.339 |
| Embarked | 0.109 | 0.194 | 0.218 | 1.000 | 0.996 | 0.096 | 0.064 | 0.215 | 0.260 | 0.299 | 0.119 | 0.050 | 0.000 | 0.271 | 0.128 | 0.089 | 0.154 | 0.160 | 0.315 |
| EmbarkedClass | 0.164 | 0.252 | 0.313 | 0.996 | 1.000 | 0.123 | 0.101 | 0.292 | 0.477 | 0.636 | 0.142 | 0.000 | 0.000 | 0.996 | 0.167 | 0.093 | 0.339 | 0.180 | 0.515 |
| FamilyClass | -0.356 | 0.286 | 0.049 | 0.096 | 0.123 | 1.000 | 0.907 | 0.200 | 0.311 | 0.175 | 0.830 | 0.714 | -0.061 | 0.210 | 0.240 | 0.771 | 0.199 | 0.278 | 0.241 |
| FamilySize | -0.221 | 0.274 | 0.000 | 0.064 | 0.101 | 0.907 | 1.000 | 0.516 | 0.255 | 0.129 | 0.634 | 0.793 | -0.045 | 0.147 | 0.186 | 0.843 | 0.204 | 0.242 | 0.220 |
| Fare | 0.174 | 0.090 | 0.290 | 0.215 | 0.292 | 0.200 | 0.516 | 1.000 | 0.466 | 0.330 | 0.285 | 0.392 | 0.027 | 0.489 | 0.156 | 0.431 | 0.266 | 0.067 | 0.307 |
| FareGroup | 0.228 | 0.218 | 0.356 | 0.260 | 0.477 | 0.311 | 0.255 | 0.466 | 1.000 | 0.490 | 0.582 | 0.242 | 0.003 | 0.565 | 0.202 | 0.301 | 0.283 | 0.205 | 0.526 |
| GenderClass | 0.252 | 0.292 | 0.393 | 0.299 | 0.636 | 0.175 | 0.129 | 0.330 | 0.490 | 1.000 | 0.294 | 0.102 | 0.068 | 0.998 | 0.997 | 0.114 | 0.616 | 0.513 | 0.985 |
| IsAlone | 0.378 | 0.379 | 0.189 | 0.119 | 0.142 | 0.830 | 0.634 | 0.285 | 0.582 | 0.294 | 1.000 | 0.667 | 0.000 | 0.113 | 0.279 | 0.834 | 0.167 | 0.466 | 0.478 |
| Parch | -0.258 | 0.284 | 0.000 | 0.050 | 0.000 | 0.714 | 0.793 | 0.392 | 0.242 | 0.102 | 0.667 | 1.000 | -0.012 | 0.000 | 0.226 | 0.426 | 0.117 | 0.242 | 0.226 |
| PassengerId | 0.090 | 0.075 | 0.000 | 0.000 | 0.000 | -0.061 | -0.045 | 0.027 | 0.003 | 0.068 | 0.000 | -0.012 | 1.000 | 0.009 | 0.079 | -0.053 | 0.024 | 0.057 | 0.049 |
| Pclass | 0.307 | 0.314 | 0.598 | 0.271 | 0.996 | 0.210 | 0.147 | 0.489 | 0.565 | 0.998 | 0.113 | 0.000 | 0.009 | 1.000 | 0.100 | 0.137 | 0.315 | 0.177 | 0.992 |
| Sex | 0.291 | 0.273 | 0.159 | 0.128 | 0.167 | 0.240 | 0.186 | 0.156 | 0.202 | 0.997 | 0.279 | 0.226 | 0.079 | 0.100 | 1.000 | 0.189 | 0.536 | 0.986 | 0.980 |
| SibSp | -0.189 | 0.257 | 0.000 | 0.089 | 0.093 | 0.771 | 0.843 | 0.431 | 0.301 | 0.114 | 0.834 | 0.426 | -0.053 | 0.137 | 0.189 | 1.000 | 0.166 | 0.296 | 0.270 |
| Survived | 0.217 | 0.132 | 0.294 | 0.154 | 0.339 | 0.199 | 0.204 | 0.266 | 0.283 | 0.616 | 0.167 | 0.117 | 0.024 | 0.315 | 0.536 | 0.166 | 1.000 | 0.551 | 0.633 |
| Title | 0.418 | 0.432 | 0.137 | 0.160 | 0.180 | 0.278 | 0.242 | 0.067 | 0.205 | 0.513 | 0.466 | 0.242 | 0.057 | 0.177 | 0.986 | 0.296 | 0.551 | 1.000 | 0.994 |
| TitleClass | 0.320 | 0.496 | 0.339 | 0.315 | 0.515 | 0.241 | 0.220 | 0.307 | 0.526 | 0.985 | 0.478 | 0.226 | 0.049 | 0.992 | 0.980 | 0.270 | 0.633 | 0.994 | 1.000 |
Missing values
Sample
| PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | Title | FamilySize | IsAlone | Deck | AgeGroup | FareGroup | EmbarkedClass | GenderClass | TitleClass | FamilyClass | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | not survived | 3 | Braund, Mr. Owen Harris | male | 22.0 | 1 | 0 | A/5 21171 | 7.2500 | NaN | S | Mr | 2 | 0 | U | Young Adult | Low | S3 | male3 | Mr3 | 23 |
| 1 | 2 | survived | 1 | Cumings, Mrs. John Bradley (Florence Briggs Thayer) | female | 38.0 | 1 | 0 | PC 17599 | 71.2833 | C85 | C | Mrs | 2 | 0 | C | Adult | High | C1 | female1 | Mrs1 | 21 |
| 2 | 3 | survived | 3 | Heikkinen, Miss. Laina | female | 26.0 | 0 | 0 | STON/O2. 3101282 | 7.9250 | NaN | S | Miss | 1 | 1 | U | Young Adult | Low | S3 | female3 | Miss3 | 13 |
| 3 | 4 | survived | 1 | Futrelle, Mrs. Jacques Heath (Lily May Peel) | female | 35.0 | 1 | 0 | 113803 | 53.1000 | C123 | S | Mrs | 2 | 0 | C | Young Adult | High | S1 | female1 | Mrs1 | 21 |
| 4 | 5 | not survived | 3 | Allen, Mr. William Henry | male | 35.0 | 0 | 0 | 373450 | 8.0500 | NaN | S | Mr | 1 | 1 | U | Young Adult | Medium-Low | S3 | male3 | Mr3 | 13 |
| 5 | 6 | not survived | 3 | Moran, Mr. James | male | 26.0 | 0 | 0 | 330877 | 8.4583 | NaN | Q | Mr | 1 | 1 | U | Young Adult | Medium-Low | Q3 | male3 | Mr3 | 13 |
| 6 | 7 | not survived | 1 | McCarthy, Mr. Timothy J | male | 54.0 | 0 | 0 | 17463 | 51.8625 | E46 | S | Mr | 1 | 1 | E | Adult | High | S1 | male1 | Mr1 | 11 |
| 7 | 8 | not survived | 3 | Palsson, Master. Gosta Leonard | male | 2.0 | 3 | 1 | 349909 | 21.0750 | NaN | S | Master | 5 | 0 | U | Child | Medium-High | S3 | male3 | Master3 | 53 |
| 8 | 9 | survived | 3 | Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg) | female | 27.0 | 0 | 2 | 347742 | 11.1333 | NaN | S | Mrs | 3 | 0 | U | Young Adult | Medium-Low | S3 | female3 | Mrs3 | 33 |
| 9 | 10 | survived | 2 | Nasser, Mrs. Nicholas (Adele Achem) | female | 14.0 | 1 | 0 | 237736 | 30.0708 | NaN | C | Mrs | 2 | 0 | U | Teenager | Medium-High | C2 | female2 | Mrs2 | 22 |
| PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | Title | FamilySize | IsAlone | Deck | AgeGroup | FareGroup | EmbarkedClass | GenderClass | TitleClass | FamilyClass | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 702 | 703 | not survived | 3 | Barbara, Miss. Saiide | female | 18.0 | 0 | 1 | 2691 | 14.4542 | NaN | C | Miss | 2 | 0 | U | Teenager | Medium-Low | C3 | female3 | Miss3 | 23 |
| 703 | 704 | not survived | 3 | Gallagher, Mr. Martin | male | 25.0 | 0 | 0 | 36864 | 7.7417 | NaN | Q | Mr | 1 | 1 | U | Young Adult | Low | Q3 | male3 | Mr3 | 13 |
| 704 | 705 | not survived | 3 | Hansen, Mr. Henrik Juul | male | 26.0 | 1 | 0 | 350025 | 7.8542 | NaN | S | Mr | 2 | 0 | U | Young Adult | Low | S3 | male3 | Mr3 | 23 |
| 705 | 706 | not survived | 2 | Morley, Mr. Henry Samuel ("Mr Henry Marshall") | male | 39.0 | 0 | 0 | 250655 | 26.0000 | NaN | S | Mr | 1 | 1 | U | Adult | Medium-High | S2 | male2 | Mr2 | 12 |
| 706 | 707 | survived | 2 | Kelly, Mrs. Florence "Fannie" | female | 45.0 | 0 | 0 | 223596 | 13.5000 | NaN | S | Mrs | 1 | 1 | U | Adult | Medium-Low | S2 | female2 | Mrs2 | 12 |
| 707 | 708 | survived | 1 | Calderhead, Mr. Edward Pennington | male | 42.0 | 0 | 0 | PC 17476 | 26.2875 | E24 | S | Mr | 1 | 1 | E | Adult | Medium-High | S1 | male1 | Mr1 | 11 |
| 708 | 709 | survived | 1 | Cleaver, Miss. Alice | female | 22.0 | 0 | 0 | 113781 | 151.5500 | NaN | S | Miss | 1 | 1 | U | Young Adult | High | S1 | female1 | Miss1 | 11 |
| 709 | 710 | survived | 3 | Moubarek, Master. Halim Gonios ("William George") | male | 5.5 | 1 | 1 | 2661 | 15.2458 | NaN | C | Master | 3 | 0 | U | Child | Medium-High | C3 | male3 | Master3 | 33 |
| 710 | 711 | survived | 1 | Mayne, Mlle. Berthe Antonine ("Mrs de Villiers") | female | 24.0 | 0 | 0 | PC 17482 | 49.5042 | C90 | C | Rare | 1 | 1 | C | Young Adult | High | C1 | female1 | Rare1 | 11 |
| 711 | 712 | not survived | 1 | Klaber, Mr. Herman | male | 42.0 | 0 | 0 | 113028 | 26.5500 | C124 | S | Mr | 1 | 1 | C | Adult | Medium-High | S1 | male1 | Mr1 | 11 |